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The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 351(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 

Claims 1,2, 11 and 12 are rejected under 35 U.S.C. 102(e) as being anticipated by Paik et 
al. (U.S. Patent 6,148,312). 

Claim 1 : Reference is made to FIG. 4. Step (402) involves the extraction and examination 
of text (contents of articles) and text fact data (metadata). The metadata has at least three parts: 
(1) target object (article content in the metadata — col. 6, hne 43): (2) an attribute name (the title 
of the article in the metadata — col. 6, line 42: and (3) an attribute value (versions of the article 
which are archived — col. 3, lines 15-16). 

As seen in FIG. 4 step (404-406), articles which are missing metadata will have metadata 
created for that article. This is a grouping of data (metadata) with an article. Doing this step for 
all the articles is a step of data aggregation. The complete resulting set of information, including 
all metadata, corrected articles and non-corrected articles form an aggregated data set. 

As seen in FIG. 4 step (408), the system detects an inconsistent data group (metadata that 
lacks an associated article of information) by scanning data set after aggregation (404-406) has 
occurred. The system then erases the erroneous metadata and determines the remaining data to 
be correct by proceeding to the "Return" step. The Delete Metadata" step aids in unifying the 
correct metadata with the correct article. 
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Claim 2: Step (402) is the data extraction unit. Metadata extracted includes target object 
(article content); attribute name (title of article) and attribute value (versions of article which 
have been archived). Step (406) is the data aggregating unit since it creates new metadata when 
it does exist and aggregates the data with each article. All the resulting data is the "one data set". 
Step (408) is the inconsistency detecting unit which detects metadata that is not associated with 
an article. The correctness determining unit is the process step "Return" since only correct 
metadata associated with correct articles are determined to exist at this step. The data integrating 
unit is the "Delete Metadata" step since this step aids in integrating only the correct metadata 
with the correct articles by deleting metadata lacking any association with an article. 

Claim 1 1 : See remarks for claim 1 . 

Claim 12: See remarks for claim 1. Note that the grouped fact data is completed at steps 
(404-406) so the inconsistency detection step (408) must occur on the grouped fact data. 
Claims 3-10 are allowed. 

Remarks 

Applicant's arguments and amendments have been entered and considered. 

Claim 1: Applicant's primary argument is that the aggregated data set is that data set 
which is formed from extracted data, not a combination of extracted fact data and text. This 
argument has been considered, but is not consistent with what is being claimed. Claim 1 calls for 
"..at least one aggregated data set from the extracted fact data..". This limitation does not state 
that the aggregated data excludes combinations of extracted data and text data, it only states that 
the aggregated includes at least some extracted data . In Paik et al., the aggregated data set 
created at steps 404-406 include the extracted data (contents of articles) and, when necessary, 
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newly created text data (metadata) which are aggregated together. This is consistent with what is 
being currently claimed. The aggregated data set also includes articles which already have 
metadata , so in this alternate case, both the article data and metadata are the extracted data. 

Applicant also argues that Paik et al. does not disclose the extracted data including target 
object, attribute name and attribute value. This argument is not correct. Each of these features are 
taught in Paik et al, as outlined in greater detail on page 2, in the first paragraph of the 
discussion for claim L 

Claim 2: Applicant argues that the articles which are extracted and examined in step 
(402) do not constitute a "data set". This argument is of correct. An article contains information 
and that information constitutes data. The descriptions of the article (metatdata), such as the title 
of the article or version of the article also constitute data. 

Applicant argues that Paik et al. does not disclose the two separate operations of 
detecting inconsistency and determining correctness/incorrectness. This argument is not correct 
for two reasons. First, claim 2 is addressed to an apparatus and not a method, so it is not actually 
claiming operations (method steps), but are claiming individual sets of instructions. Secondly, 
Paik et al. has two separate instructions which meet the claimed limitations. Paik et al. has an 
instruction for inconsistency detection at (step 408, detecting articles without metadata) and a 
correctness determining instruction ("return" — returns only a set of correct metadata associated 
with correct articles). 

Claims 11: Applicant's arguments for this claim reflect those arguments which were 
already presented for claim 2. 
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Claim 12: Applicant argues that Paik et al. does not disclose any form of scanning 
operation on grouped fact data. This argument is moot. Claim 12 contains no mention of 
scanning information. 

This rejection is made in response to an RCE request and is made non- final. 

Any inquiry concerning this communication should be directed to Sam Rimell at 
telephone number (571) 272-4084 // 
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Primary Examiner 
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